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Abstract 

Fix a choice and ordering of four pairwise non-adjacent vertices 
of a parallelepiped, and call a motif a sequence of four points in M 3 
that coincide with these vertices for some, possibly degenerate, paral- 
lelepiped whose edges are parallel to the axes. We show that a set of r 
points can contain at most r 2 motifs. Generalizing the notion of motif 
to a sequence of L points in K p , we show that the maximum number 
of motifs that can occur in a point set of a given size is related to 
a linear programming problem arising from hypergraph theory, and 
discuss some related questions. 



1 Introduction 

Let a motif be a quadruple of four points (a, b, c, d) in M 3 that lie on pairwise 
non-adjacent vertices of a parallelepiped with edges parallel to the x-, y-, and 
shown in Figure 1. We allow the parallelepiped to be a reflection 
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of the one shown here, or to be degenerate, so that the only constraint on a 
motif is that there are certain equality constraints between the coordinates 
of a, b, c, and d; we require the x coordinates of a and c to be the same, the 
y coordinates of b and c to be the same, and so on, as shown in the table in 
Figure 1. (The term is not to be confused with the notion of motif as used 
in algebraic geometry.) 

Given a set S of r distinct points in M 3 , how many motifs can be made 
out of the points in <S? We will show that there can be at most r 2 . Foliate M 3 
by planes parallel to the xy-plane, and suppose that there are n such planes, 
Pi, . . . , P n , say, that contain points of S, containing r 1; . . . , r n points of 
S, respectively. We may order the planes so that r x > • • • > r n > 0. Now, 
in any motif, a and b must be contained in the same plane, Pj say, and c 
and d must be in the same plane, Pj, say. If % < j, then fix j, and observe 
that a and b are determined by c, d, and i; there are then at most j ways of 
choosing i, and r| ways of choosing c and d, so our choice of j contributes at 
most jr? motifs. On the other hand, if i > j, fix i, and observe that c and 
d are determined by a, b, and j, and that there are at most % — 1 ways of 
choosing j, so our choice of i contributes at most (i — l)r 2 motifs. Therefore, 
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the total number of motifs possible is no more than 

E + E (« - w = E ( 2< - w 

l<j<n 2<i<n l<i<n 

< E r i (2r 1 + --- + 2r i _ 1 + r i ) 

l<i<n 

= (n + • • • + r n ) 2 = r 2 , as desired. 

On the other hand, if we choose 5 to be all points in a three-dimensional 
grid of A points by B points by C points, S will have size r = ABC and will 
admit A 2 B 2 C 2 = r 2 motifs. This shows that the upper bound r 2 is optimal. 

2 A generalization 

We generalize the notion of motif to L-tuples of points in IR P as follows: given 
positive integers L and p, call a motif specification onp,L a sequence 7Ti, . . . , 
7T P of partitions of subsets of {1, . . . ,L}, together with constants D irn G R 
for each pair (i,m) such that i G {1,...,L}, m G and i is in 

no block of 7r m . A tuple (/i, . . . , /&) G (M P ) L will then be a motif (for that 
specification) if: 

1. Whenever i, j G {1, ...,L} are in the same block of 7i m , for some 
m G {1, . . . ,p}, then (fi) m = (fj) m . 

2. Whenever i G {1, . . . , L} is in no block of 7r m , for some m G {1, . . . ,p}, 
then (fi) m = A m . 

Given a subset 5 C M p and a motif specification, we let A4(S) be the 
number of motifs in S, i.e., the number of L-tuples of elements of S that are 
motifs. We want to know how large JA(S) can be for a set S of given size. 
To state and prove an estimate for this, we make the following definitions: 

• Let the hypergraph of a motif specification be the hypergraph with 
vertex set {1, . . . , L} and an edge for each block of each partition 7Ti, 
. . . , Tip, such that each edge is incident on the vertices that are elements 
of the corresponding block. 

• Given a hypergraph H, let V(H) be the set of its vertices and E(H) 
be the set of its edges. We write x G e if the edge e is incident on the 
vertex x. 
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• PP A hypergraph is uniform if all its edges have the same size. 

• [3][T] A fractional transversal g of a hypergraph if is a function g : 
V(H) -> R> such that J2 xee g(x) > 1 for all e G £(if). We call 
^2 x g{x) the tota/ weight of g. 

• [3][T] The fractional transversal number, t*(H), of a hypergraph H is 
defined to be 

t*(H) := min{y^ | g a fractional transversal of H.} 

X 

• [3][T] A fractional matching f of a hypergraph if is a function / : 

-»■ M> such that J2 xee f( e ) < 1 for a11 x e V ( H )- We cal1 
X) e /( e ) ^ ne t°t a l weight of /. 

• [3][T] The fractional matching number, u*(H), of a hypergraph if is 
defined to be 

v*{H) : = max{^ /(e) | / a fractional matching of H.} 

e 

• Given any m > and any function F : M m — > R, write * for an 
argument of F in order to denote summing over that argument: 

G(*) = J>(a), 
F(*,*,c) = £>(a,&,c), 

a,6gK 

etc. 

The hypergraph of a motif specification may contain multiple edges which 
are incident on the same set of vertices. However, this need not be considered 
when computing r* and v* , since if the hypergraph H' is obtained from H 
by omitting duplicate edges, then r*{H') = r*(H) and u*(H') = v*{H). 

Theorem 1. f^fj^ For all hypergraphs H , t*(H) = v*{H). Also, if a frac- 
tional transversal and fractional matching of H have the same total weight 
w, then w = r*(H) = u*(H). 
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Proof. If M is the incidence matrix of H, t*(H) is the optimal value for the 
linear programming problem of minimizing l T x, where x > and Mx > 1. 
Also, u*(H) is the optimal value for the linear programming problem of 
maximizing l T y, where y > and M T y < 1. However, these problems are 
dual, so they have the same optimal value and t*(H) = u*(H). The second 
claim now follows from the definitions of r* and u*. □ 

In the case of our introductory example, L = 4 and p = 3, and the motif 
specification uses partitions tti = {{1, 3}, {2, 4}}, 7r 2 = {{1, 4}, {2, 3}}, and 
7r 3 = {{1, 2}, {3, 4}}. The hypergraph H is therefore a complete graph on 
4 vertices, and one fractional transversal of H is the function go that takes 
all vertices to 1/2. Similarly, one fractional matching of H is the function 
that takes all edges to 1/3. Since these have the same total weight, 2, we see 
that t*(H) = v*{H) = 2, and that g and / are optimal. 

We can now prove an upper bound on Ai(S) using the next lemma. Take 
0° to be 1. 

Lemma 2. Given some n > 0, functions Fx, . . . , F n : M — > M>o each of 
which is nonzero at most at only finitely many places, and exponents hi, . . . , 
h n > such that hi + ■ ■ ■ + h n > 1, then 

Fi(a) hl ■ ■ ■ Fn{a) hn < Fi(*) hl ■ ■ ■ F n (*) h ". 

Proof. Zero exponents do not contribute to the product on either side, so 
discard them, and assume that hi > for all i. Then, in the case where 
hi = 1, the inequality is Theorem 11 of [2], which is a form of Holder's 
inequality. If ^ hi > 1, the inequality is Theorem 22 of [2], and follows from 
the hi = 1 case together with the inequality 




r > 1, x a > 0. 



□ 

Theorem 3. Given any motif specification on p, L, let H be its hypergraph, 
and let S be a subset of MP with size r. Then Ai(S) < r T *( H \ 

Proof. Let / : IR P — > {0, 1} be the indicator function of S. Fix a motif speci- 
fication. Any motif can then be written as an L by p matrix of coordinates 
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whose entries are variables and constants, where there is one variable for each 
block of each 7Tj, and the constants are the -Dj m 's, placed in the appropriate 
spots of the matrix. For instance, for the motif specification used in the 
introduction, the matrix will be a 4 by 3 matrix containing six variables, as 
shown on the right-hand side of Figure 1. In general, we may suppose that 
we use variables Vi, . . . , v n , and we can then write the number of motifs in 
S as 

M(S)= I(Ri)---HRl), (1) 

i>i,...,u n eiR 

where Ri, . . . , Rl are p-tuples of variables and constants obtained from 
reading off the rows of the matrix. Now, taking the number or 1 and 
raising it to an arbitrary nonnegative power will not decrease it. So, from 
(P), we find that the number of motifs in S is no more than 

Y= I{Ri) 9[X) ---I{Rl) 9{L \ (2) 

vi,...,v n eR 

where g is an optimal fractional transversal of H. For example, using the 
matrix in Figure 1 and the optimal fractional transversal go mentioned above, 
02]) will take the form 

q,s,t,u,v,w£R. 

Now, observe that any given variable in the sum corresponds to an 
edge, e, say, of H, and that the sum of the exponents of the J(i2j)'s in which 
this variable appears equals J2 xee g(x), which, by the definition of a fractional 
transversal, is at least 1. We can therefore apply Lemma [2] to eliminate the 
sum over this variable, replacing it by * wherever it appears. This will not 
decrease the value of (T5]). In the case given in Figure 1, for example, we 
observe that 

]T I(q, u, vf/ 2 I(q, t, w) 1 ' 2 < /(*, u, v) 1 ^*, t, w) 1 ' 2 

and so 

Y < Yl J (*' M ' w ) 1/2/ ( s > *» v) 1/2 I(*, t, w) 1/2 I(s } u, w) 1 ' 2 . 

s,t, u,v,vjSM. 
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If we repeat this operation until all variables are removed, we will find that 



Y < I{R 



■■■i(R' L ) 9{L) , 



where each R[ is a p-tuple each of whose elements is either a constant or *. 
Since each /(-R-) can be no bigger than r, this shows that S has no more 



We can also prove a lower bound which shows that in infinitely many 
cases, this upper bound is best possible, up to a constant factor. 

Theorem 4. Given any motif specification on p, L, let H be its hypergraph. 
Then, for all N > 0, there exists a subset S of MP such that the size, r, of S 
is at least N, and M{S) > {r/Lf^ H \ 

Proof. Let / be an optimal fractional matching of H all of whose values are 
rational. If /(e) = for all edges e of H, then u*(H) = and the theorem is 
trivial, since it reduces to saying that there exist arbitrarily large sets with 
at least one motif. Otherwise, / has at least one positive value. Let d > be 
a common multiple of the denominators of all positive values of /, and write 
down the L by p matrix of coordinates, as we did in Theorem [31 Now, fix a 
positive integer M, and let each variable v in the matrix of coordinates run 
over M d tt e > values, where e is the edge of H corresponding to v. The number 
of values taken on by the row of the matrix corresponding to the vertex x 
will then be M d ^e e /( e ) ; which, by the definition of a fractional matching, is 
at most M d . Therefore, taking the set of all possible rows of the matrix will 
construct a set S with size at most LM d . Also, we must have ^ Ige /(e) = 1 
for at least one vertex x, since if not, we could increase any /(e) slightly and 
still have a fractional matching, contradicting optimality of /. It follows that 
the size of S must be at least M d . Finally, the number of motifs in S is at 
least M d ^/( e ) = M du *( H \ Letting M become large completes the proof. □ 

Remark. Start with the matrix of coordinates used in Theorems [3] and 
HI and make a tableau by leaving the constants in the matrix unchanged, but 
replacing each variable v with the indeterminate Xfr e ), where e is the edge of 
H corresponding to v, and / is some optimal fractional matching of H. Then 
it follows from the proof of Theorem H] that Theorem H] may be improved by 
replacing L with U, where V is the number of distinct rows in the tableau. 



than r T *^ H > motifs, as desired. 



□ 
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3 Uniform motifs and grids 



Suppose that the motif specification contains no constants and that its hy- 
pergraph is uniform, i.e., all blocks in all partitions have the same size, n, 
say. We may then reorder the coordinates of R p so as to place each set of 
identical partitions in the specification into a contiguous range; let there be 
q such ranges, with lengths, in order, p%, . . . , p q (pi + ■ ■ ■ + p g = p.) 

Theorem 5. Let a motif specification on p, L have no constants and a uni- 
form hypergraph, with edge size n, and let its coordinates be ordered as above. 
Then, for all r > 0, the maximum number of motifs in a subset S of MP of 
size r is always r L l n , and this maximum is attained iff S is a grid, i.e., S is 
of the form 

S\ x • • • x S q , where Si C MP' for each i = 1, . . . , q. (3) 

Proof. If iTi = TTj, then coordinates i and j of M p can be lumped together and 
treated as a single coordinate. Doing this repeatedly, we reduce to the case 
where q = p and pi — ■ ■ ■ — p q — 1. 

If we let g have value l/n on each vertex, then g is a fractional transversal 
of the hypergraph H of the motif specification. Also, if we let / have value 
1/p on each edge, then / is a fractional matching of H . However, / and g 
both have weight L/n. It follows that t*(H) = v*{H) = L/n. Now, if we 
let S contain r points which have all coordinates zero except the first, the 
first coordinate ranging over 1, . . . , r, then S will have exactly r L//n motifs. 
Together with Theorem [3], this proves that the maximum number of motifs 
in a set of size r is r L// ™. 

If S is of the form ([3]), then write down the L by p matrix of coordinates 
referred to in Theorem We can obtain a motif in S by, for each i, substi- 
tuting an arbitrarily chosen element of <Sj for each variable in column i. If 
each set <Sj has size (i = 1, . . . , p), it follows that S contains at least 

J] rf /n = ( n r i ) L /" = r L l n 

l<i<p 

motifs. It remains to prove that any set which attains the maximum number 
of motifs for its size is of the form (J3]). 

Suppose that S is a set which attains the maximum number of motifs for 
its size, and let S have indicator function / : 1R P — > M. We may assume that S 
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is nonempty, as otherwise the result is trivial. Now, in the proof of Theorem 
[31 we construct a chain of inequalities starting from Ai(S) and ending at 
r L / n . To have r L// ™ motifs in S, all these inequalities must be equalities. 
Looking at the matrix of coordinates, we see that a new inequality appears 
in the chain whenever we remove a variable and replace it by *. Since we can 
choose to remove the variables in any order, it follows that we must have 



/(i2i) 1/n • • • I(R L ) 1/n = I(R[) 1/n ■ ■ ■ I(R' L ) 1/n , (4) 



whenever R\, . . . , Rl result from taking the rows of the matrix of coordinates 
and substituting each variable by either a real number, *, or the variable q, 
and the R'fi are derived from the RiS by substituting * for q. 

We prove by induction on s that for all s — 0, . . . , p, there is a constant 
C > (namely C = /(*, . . . , *)), and functions Fx, . . . , F s : R — > M>o, each 
zero except at finitely many points, such that 

I(bi,...,b s ,*,...,*) = CF 1 (b 1 )---F s (b s ), for all h, b s G E. (5) 

If s = 0, this is clear, with the given value of C, since there are no functions 
Fi. Otherwise, assume that ([5]) has been proved for s = t — 1. We wish to 
prove it for s = t. Fix some Ci, . . . , c t -\ such that /(ci, . . . , *,...,*) > 0, 
and set 

:= /(c,-,*-,,,, .,-,.) 6 R 

J(ci,...,c t _i, *,*,...,*) 

We now induce secondarily on the number of coordinates j in {1, . . . , t — 1} 
for which bj ^ Cj. Let this number be m. We have already proved ()5]) for s = t 
and m = 0. Suppose that it has been proved for s = t and m = 0, . . . , m' — 1. 
We wish to prove it for s = t and m = m'. Fix some b±, . . . , which differ 
from the q's in exactly m' places. Assume that I(b±, . . . , b t -i, *,...,*)> 0; 
if not, by (jSJ) for s = t — 1, at least one of Fi(bi), . . . , F t -x(bt-i) must be zero, 
so both sides of (jSJ) are zero and we are done. Pick some i G {1, . . . , t — 1} for 
which bi 7^ q, and set bj = bj if j ^ i and j G {1, . . . , i — 1}, fej = q. We now 
fill in the matrix of coordinates with four different possibilities for a row: 



R'{ = 




..,6t_i,g,*,.. 


-,*), 






..,6^,5,*,.. 


-,*), 


R'i = 


(6i,.. 


. . , 6 t _i, *,*,.. 


.,*), and 


R 4 = 


(&;,•• 




■,*)• 
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The rows are determined as follows. By assumption, 7Tj 7^ 71^, so we can find 
some u and v which are in the same block of n t but different blocks of tt^. 
We set R w := R'{ for each row w such that u> is in the same block of TT t as u 
and i> , and w is in the same block of iXi as u\ R w := i?^' f° r each row if such 
that w is in the same block of 7t t as u and but w is not in the same block 
of 7i i as m; R w := R'l for each row w such that to is not in the same block of 
7T t as u and v, but w is in the same block of 7Tj as u\ and i? w := R\ in rows 
w such that w is not in the same block of 7r t as u and t> , and w is not in the 
same block of Tii as u. We then apply @. We observe that by assumption 
and by © for s = t - 1, > and Fj(cj) > for j = 1, . . . , t — 1, so 

/(-R3) > and I(R'l) > 0. We can therefore remove factors of I{R'i) l/n and 
/(i?") 1 /™ whenever they appear on both sides of (jlj), leaving the equality 



E 



[I(R'D x/n I(R'^ n - x)/n = I(R'^ x/n I(R'^ (n - x)/n 
1 

Here, x is the number of indices w such that u and u> are in the same blocks 
of 7Tj and 7T t ; it follows that there are n — x indices w such that u and w are in 
the same block of 7T t but different blocks of 7Tj. By construction, < x < n. 
Then, by Theorem 11 of [2], 

6 f _i,g, *,...,*) = .., 6^,9, *,...,*) for ^ e R 

J(6i,...,6t_i, *,*,...,*) &t_i,*, *,...,*)' 

(6) 

Since (jSJ) has been proved for s = t and m = m! — 1, the right-hand side of 
06]) must equal F t (q). Then from (jSJ) and (jSJ) for s = t — 1 , it follows that (jSJ) 
holds for s = t with the given & 1; . . . , 6 t _ 1; and b t = q arbitrary Since & 1( 
. . . , were chosen arbitrarily with m = m', this completes the secondary 
induction, and hence the overall induction. 

We conclude that for some constant C > and functions F\, . . . , F p : R — > 
R>o, we have 

!(&!, . . . , 6 P ) = CF^fei) • • • F p (b p ), for all 61 , . . . , b p e R. 
Therefore, 5 is a grid, as desired. □ 



4 Single-starred motifs 

We now discuss another case where we can determine the asymptotic behav- 
ior of the maximum number of motifs in a set of given size. Call a motif 
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specification on p, L single- starred if it contains no constants, and, for each 
i = 1, . . . , L, there exists a unique j such that ttj contains the block {i}. 
In this case, any fractional transversal g of the hypergraph H of the speci- 
fication must have g(i) > 1 for all i — 1, . . . , L; since this condition is also 
sufficient to make g a fractional transversal, we see that r*(H) = L and that 
an optimal fractional transversal must have g(i) — 1 for all i. 

For a single-starred specification, write down the matrix of coordinates 
used in Theorems |3] and HJ and make a tableau by replacing each variable 
corresponding to a block of size 1 with *. By assumption, there will be 
exactly one * in each row. Each assignment v of values to the remaining 
variables then determines fi(v), . . . , fhiy) in (R U {*}) p , given by reading 
off the rows of the tableau. Let k be the number of variables remaining. By 
choosing an ordering of the variables, we may think of v as being a member 
ofR k . 

Let Mi(L) := L(L L + 1), and let M(L) := (2M 1 (L)) L - 1 L 2 + 1; we have 
M(L) > Mi(L) for all L. We now make the following definitions: 

• A point v G M k is a center of a set S C MP with size r and indicator 
function / : W {0, 1} if at least L - 1 of J(/i(«)), • • • , I(h(v)) are 
at least r/M(L). 

• A point v G M. k is a hypercenter of a set 5 C ]R P with size r and 
indicator function / : W -> {0, 1} if all of I(fi(v)), . . . , I(f L (v)) are 
at least r/M^L). 

• A point u> G M p is m /me mt/i a point v G M fc if its coordinate vector 
is of the form fi(v), for some i G {1, . . . , L}. 

It follows that every hypercenter of S is also a center of S. 

For a single-starred motif specification, let C\, . . . , C g be the columns of 
the tableau where a * occurs, and let a\ > 0, . . . , a q > be the number of 
*s occurring in columns C\, . . . , C q , respectively. Observe that q < L and 
that oil -| — • + a q — L. Set 



We have C > L~ L . 

Lemma 6. For each single- starred motif specification onp,L, there is some 
function <p such that <p(r) — > as r — )■ oo and such that, for any r, there 
exists a set S CMP of size r with at least C(l + 0(r))r L motifs. 
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Proof. For any nonnegative integer N, we can construct a set S of size LN 
by, for each i = 1, . . . , q, taking otiN points which have all coordinates 
except their Cjth, which ranges over {1, . . . , aiN}. There are then at least 

( ai N) ai ■ ■ ■ (a q N) a « = C(LN) L 

motifs in S. To construct a set of arbitrary size, r say, we can set N := \r / ' L\ 
and pad out S until we get a set of size r with at least C(L\r / L\) L motifs. 
Since (L[r/L\)/r — > 1 as r — > oo, this completes the proof. □ 

Lemma 7. For each single-starred motif specification on p, L, there exists ro 
such that whenever r > r and S C M p of size r has the maximum number 
of motifs possible for a set of its size, then S has a hypercenter. 

Proof. Let / : R p — > {0, 1} be the indicator function of S. Then 

M(S)=J2nh(v))---I(h(v)). (7) 



If S has no hypercenter, we know that for each v, there is some i with 
I(fi( v )) — r /Mi(L), so (JTj) can be no more than 



v ' l<i<L v m k j& 



However, each variable must occur at least twice in the tableau, so for each 
i, after deleting row i, each variable must still occur at least once. It follows 
that for each i, 

EII J ^( ,, ))^ J (*'-'*) £ " 1 = r£ " 1 ( 8 ) 

and therefore S has no more than Lr L /Mi(L) motifs. However, by Lemma 
El there exists a set of size r with at least L~ L (1 + 0(r))r L motifs, and if 
we pick ro large enough, we will have L _L (1 + 4>(r)) > (1 + L L )~ 1 for all 
t > To. This contradicts the assumption that S had the maximum number 
of motifs. □ 

Lemma 8. For each single-starred motif specification on p, L, there exists r 
such that whenever r > r and S C.W of size r has the maximum number of 
motifs possible for a set of its size, then all points in S are in line with some 
center. 
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Proof. If L = 1, all points in M. k are centers, so the result is trivial. We may 
therefore assume that L > 2. Let I : M p — > {0, 1} be the indicator function 
of S. If we remove one point, w, from S, then I(fi(v)) does not decrease 
if the coordinate vector of w is not of the form fi(v), and decreases by 1 if 
the coordinate vector is of the form fi(v). Therefore, after removing w, the 
number of motifs in S can decrease by at most 

Y --= E EII J (/i(«)). 

i<i<L ^gt; jV« 

where 71 is the set of points v G M fc for which u> is of the form fi(v). If w is 
not in line with any center, there are no centers in 7i U . . . U 72, so for any i 
and v e 71, there must be some z' 7^ i with 7(fv(t>)) < r/M(L). Therefore, 

y^j^E] E En '(AM)- 

However, when we sum over f G 71, we are only summing over variables which 
do not occur in the ith row of the tableau, so after deleting the zth and i'th 
rows, each variable summed over must still occur at least once. Therefore, 



L-2 _ r L-2 



and so we have decreased the number of motifs in S by at most (L 2 /M(L))r . 
Now, if we take ro to be at least as large as in Lemma [TJ then S must have 
a hypercenter, t> , say, and after deleting w, each of I(fi(vo)), ■ ■ ■ , i"(/z,(i>o)) 
must be at least (r/Mi(L)) — 1. If we take r to be at least as large as 
2Mi(L), adding w back to be in line with t> w ih then increase the number 
of motifs in S by at least {(r/M x (L)) - l) z_1 > (r/(2M 1 (L))) L " 1 . Since 
M(L) > (2Mi(L)) L_1 L 2 , this means that moving u> to be in line with a 
hypercenter increases the number of motifs in S. This contradicts the as- 
sumption that S had the maximum number of motifs. □ 

Lemma 9. For each single-starred motif specification on p, L, a set S can 
have at most LM{V) L ~ X centers. 

Proof. For each center v, there is some i such that I(fj(v)) > r/M(L) for 
all j 7^ i. Fix i. Then, for each j 7^ i, if we sum I(fj(v)) over the variables 
occurring in the jth row of the tableau, we will get r. Therefore, there can 
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be at most M(L) assignments of values to the variables in the jth row of the 
tableau for which I(fj(v)) > r/M(L). Recalling that, after deleting row i, 
each variable must still occur at least once, it follows that after fixing i, there 
are at most M(L) L ~ l possibilities for v. S therefore has at most LM(L) L_1 
possible centers. □ 

Theorem 10. For each single-starred motif specification on p, L, as r be- 
comes large, the maximum number of motifs in a set S C. MP of size r is 
asymptotic to Cr L . Also, there is some r such that if r > r , then a set S of 
size r with a maximum number of motifs for its size must have exactly one 
center, which must also be a hypercenter, and all points in S must be in line 
with the center. In addition, the points in S must fall into q lines, i\, . . . , 
£ q , say, where each ^ is given by fixing all coordinates except the Cith. 

Proof. We have already proved that Cr L is an asymptotic lower bound in 
Lemma El We now need to prove that it is also an asymptotic upper bound, 
and that optimal sets S have the desired form. 

Let r be at least as large as required in Lemmas [7] and El and let S be a 
set of size r > r with a maximum number of motifs for its size. As usual, 
the number of motifs in S is 

£/(/i(*0) •••%(«))■ 

v£R k 

We may bound this sum above by the sum of L+l pieces: the first, Z±, will be 
given by summing over all v where f\{v) ^ fi{v') for all centers v' and i = 1, 
. . . , L, . . . , the Lth, Z L , by summing over all v where /z,(v) ^ fi(v') for all 
centers v' and i = 1, . . . , L, and the last, Z L+1 , is given by summing over all 
v where, for each j — 1, . . . , L, there is some center v' and i G {1, . . . , L} for 
which fjiy) = fi{v'). To bound the first piece, observe that if f\{v) ^ fi(v'), 
then there are either no points whose coordinate vectors are both of the form 
fi(v) and fi(v') (in the case where the * is in the same position in fi(v) and 
fi(v')) or at most one point (in the case where the * is in different positions 
in f\(v) and in /j (?/).) Also, by Lemma [U every point in S is of the form 
fi{v'), for some i and some center v'. Then, summing over i and v' and using 
Lemma El we see that I(fi(v)) < L 2 M(L) L_1 , so by (jSJ), the first piece of 
the sum is no more than L 2 M(L) L ~ 1 r L ~ 1 . We can bound the 2nd through 
Lth pieces in the same way, so 

M(S) < Zt + ■ ■ ■ + Z L+1 < Z L+1 + L 3 M(L) i - 1 r L - 1 . (9) 
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For j = 1, . . . , L, let <Tj G {1, . . . , q} be such that there is a * in row j 
and column C a . of the tableau, and for each i = 1, . . . , q, let <Sj be the set of 
points of S whose coordinate vectors are of the form fj(v), for some center 
v of S, and some j with Uj = i. For i — 1, . . . , g, we let Si have indicator 
function Ii and size fj. By Lemma [8], all points in S must be in line with some 
center, so S = Ui<i< q Si. Now, if there is a point w which is in <Sj PI cty, for 
some i ^ i', then w is of the form fj(v) and also of the form fj>(v'), for some 
centers v and v' of 5. Every coordinate of w except the Cjth is determined 
by v and j, and every coordinate of w except the CVth is determined by v' 
and j'. Then, since i ^ i', w is determined by v, v', j, and j'. Since S can 
have at most LM[V) L ~ X centers, there are at most L 4 M(L) 2L ~ 2 possibilities 
for w. Therefore, 

ri + ■ ■ ■ + r q < r + L 5 M(L) 2L - 2 . (10) 
However, by the definition of Zi + i, 

< Y,I n (Ji(v))..-I at (f L (v)), (11) 

v£R k 

and we have the obvious inequality 

where, for each i, V{ is summed over all elements of (R U {*}) p which con- 
tain exactly one *, in coordinate C a .. However, the right-hand side of this 
inequality is just 

n r - = n r 7 

l<i<L l<j<q 

and therefore from (TIT!) and ffl2l) we get 

z L+l < n <*? n (nM**- ( i3 ) 

l<j<q l<i<9 

The asymptotic upper bound Cr L now follows immediately from (Q, fflQj) , 
(fT3|), and Theorem 9 of gj. 

By Lemma d we already know that S has a hypercenter, vq, say, and by 
Lemma [U we know that all its points are in line with some center. It remains 
to show that S has only one center, and that its points fall into q lines as 
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claimed. However, if S had a center, v', say, not equal to v , then v' and 
v would have to assign some variable a different value. Since each variable 
occurs in at least two rows of the matrix of coordinates, there must therefore 
be some i for which I(f i (v r )) > r/M(L) and fi(v r ) ^ fi(vo). Now, the term 

hAhM) ■ ■ -V, (A-i(«b))^(/i(tO)W(A+i(«b)) • •■^(/tW) 

will occur on the right-hand side but not the left-hand side of (j!2p . and it 
will have value at least (r / 'M(L))(r / 'Mi(L)) i_1 ; this means that the left- 
and right-hand sides of ([13]) must differ by at least r L /(M(L)Mi(L) L_1 ). By 
our asymptotic estimate, this will prevent S from containing the maximum 
possible number of motifs, if we take sufficiently large. We conclude that S 
must have only one center, vq. Finally, if there are % and j for which <Ji = aj 
but fi(v ) ^ fj{v ), the term 

J<n(/l(«o)) • ••^- 1 (/i-l(«o))^ i (/i(vo))^T <+ i(/i+l(«o)) ■ "^(/iW) 

will occur on the right-hand side but not the left-hand side of (1T2"1) . and it 
will have value at least (r/Mi(L)) L . As before, this will prevent 5 from 
containing the maximum possible number of motifs. Therefore, we conclude 
that the set {/i(fo), • ■ • , /l(^o)} has only q distinct elements, one with a 
* in position C, for i = 1, . . . , q. Then, since every point in S must be 
in line with the unique center, Vq, every point in S must have coordinate 
vector matching one of these q distinct elements. This establishes the only 
remaining claim. □ 



5 Conclusion 

To finish, we ask two questions: 

1. For a general motif specification, what is the asymptotic behavior of 
the maximum number of motifs in a set of given size? 

2. For motif specifications with no constants and uniform hypergraph, 
sets with the maximum number of motifs for their size are always grids. 
Also, for single-starred motif specifications, large enough sets with the 
maximum possible number of motifs for their size are always the union 
of q lines. Are large sets with the maximum possible number of motifs 
for their size always unions of a small number of grids? 
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